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1 Consistency in a partitioned network: a survey 
Susan B. Davidson, Hector Garcia-Molina, Dale Skeen 
September 1985 ACM Computing Surveys (CSUR), Volume 17 issue 3 
Publisher: ACM Press 

Additional Information: fuil citation , abstract , references , citings , index 
terms , review 



Full text available: "pi pdf(3.2Q MB) 



Full text available: g pdf(2.37 MB) 



Recently, several strategies have been proposed for transaction processing in partitioned 
distributed database systems with replicated data. These strategies are surveyed in light 
of the competing goals of maintaining correctness and achieving high availability. 
Extensions and combinations are then discussed, and guidelines are presented for 
selecting strategies for particular applications. 

2 Monitoring distributed systems 

Jeffrey Joyce, Greg Lomow, Konrad Slind, Brian Unger 

March 1987 ACM Transactions on Computer Systems (TOCS), volume 5 issue 2 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citing s, index 
terms , review 

The monitoring of distributed systems involves the collection, interpretation, and display 
of information concerning the interactions among concurrently executing processes. This 
information and its display can support the debugging, testing, performance evaluation, 
and dynamic documentation of distributed systems. General problems associated with 
monitoring are outlined in this paper, and the architecture of a general purpose, 
* extensible, distributed monitoring system is presented. Three a ... 

The LHAM log-structured history data access method 
Peter Muth, Patrick O'Neil, Achim Pick, Gerhard Weikum 

February 2000 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 8 Issue 3-4 
Publisher: Springer-Verlag New York, Inc. 

Full text available: Q pdf(494.76 KB) Additional Information: full citation , abstract , citin gs, index terms 

Numerous applications such as stock market or medical information systems require that 
both historical and current data be logically integrated into a temporal database. The 
underlying access method must support different forms of "time-travel" queries, the 
migration of old record versions onto inexpensive archive media, and high insertion and 
update rates. This paper presents an access method for transaction-time temporal data, 
called the log-structured history data access method (L ... 



Keywords: Data warehouses, Index structures, Performance, Storage systems, Temporal 
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4 A logical semantics for object-oriented databases j 
Jose Meseguer, Xiaolei Qian 

June 1993 ACM SIGMOD Record , Proceedings of the 1993 ACM SIGMOD international 

conference on Management of data SIGMOD '93, volume 22 issue 2 
Publisher: ACM Press 

Full text available* f*1 pdf(1.15 MB) Additional Information: full citation , abstract , references , citings, index 
' ' terms 

Although the mathematical foundations of relational databases are very well established, 
the state of affairs for object-oriented databases is much less satisfactory. We propose a 
semantic foundation for object-oriented databases based on a simple logic of change 
called rewriting logic, and a language called MaudeLog that is based on that logic. Some 
key advantages of our approach include its logical nature, its simplicity without any need 
for higher-order features, the fac ... 

5 Adaptive joint multiuser detection and channel estimation in multipath fadin g CDMA [ 
channels 

Xiaodong Wang, H. Vincent Poor 

November 1998 Wireless Networks, volume 4 issue 6 

Publisher: Kluwer Academic Publishers 

Full text available' t 5 ! pdf(387.24 KB) Actional Information: full citation , abstract , references , citings , index 

terms 

The problem of joint multiuser detection and channel estimation in frequency-selective 
Rayleigh fading CDMA channels is considered. First the optimal multiuser detector for 
such channels is derived, which is seen to have a computational complexity exponential in 
the product of the number of users and the length of the transmitted data sequence. Two 
suboptimal detectors are then developed and analyzed, both of which employ 
decorrelating filters at the front-ends to eliminate the multiple-acc ... 

6 Documentation Project: Academic software resources | 
Carole (Kelly) Havens 

November 1982 Proceedings of the 10th annual ACM SIGUCCS conference on User 

services SIGUCCS '82 
Publisher: ACM Press 

Full text available: ^ pdf(246.13 KB) Additional Information: full citation , abstract , index terms 

Over a period of years, an academic user services group can go through enough 
personnel changes to cause a loss in continuity in documentation. As a result, hundreds of 
inadequately documented resources can accumulate, become obsolete, or come to exist 
in multiple copies and versions. User Services may find themselves having to rely on 
individual users or former staff to supply information on the more obscure resources. 
Their task, therefore, becomes the creation of a data bank of informati. ... 

The dangers of replication and a solution | 
Jim Gray, Pat Helland, Patrick O'Neil, Dennis Shasha 

June 1996 ACM SIGMOD Record , Proceedings of the 1996 ACM SIGMOD international 

conference on Management of data SIGMOD '96, volume 25 issue 2 
Publisher: ACM Press 

Full text available* 151 df(1 22 MB) Additional Information: full citation , abstract , references , citings, index 

terms 

Update anywhere-anytime-anyway transactional replication has unstable behavior as the 
workload scales up: a ten-fold increase in nodes and traffic gives a thousand fold increase 
in deadlocks or reconciliations. Master copy replication (primary copy) schemes reduce 
this problem. A simple analytic model demonstrates these results. A new two-tier 
replication algorithm is proposed that allows mobile (disconnected) applications to 
propose tentative update transactions that are later applied to a mast ... 
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8 Revokable and versatile electronic money (extended abstract) 
Markus Jakobsson, Moti Yung 

January 1996 Proceedings of the 3rd ACM conference on Computer and 
communications security CCS '96 

Publisher: ACM Press 

Full text available: g pdf d .53 MB) Additional Information: full citation , references , citings, index terms 

J 



9 Multiversion diver g ence control of time fuzziness 

^ Calton Pu, Miu K. Tsang, Kun-Lung Wu, Philip S. Yu 

November 1994 Proceedings of the third international conference on Information and 

knowledge management CIKM '94 
Publisher: ACM Press 

Full text available: © pdf(980.75 KB) Additional Information: fall citation , abstract, references , citings, index 
L - terms 

Epsilon Serializability (ESR) has been proposed to manage and control inconsistency in 
extending the classic transaction processing. ESR increases system concurrency by 
tolerating a bounded amount of inconsistency. In this paper, we present multiversion 
divergence control (mvDC) algorithms that support ESR with not only value but also time 
fuzziness in multiversion databases. Unlike value fuzziness, accumulating time fuzziness is 
semantically different. As... 

10 A class of replacement policies for medium and hi g h-associativity structures 
Yannick Deville, Jean Gobert 

March 1992 ACM SIGARCH Computer Architecture News, Volume 20 issue l 
Publisher: ACM Press 

Full text available: *Q pdf (1 .03 MB) Additional Information: full citation , abstract , citings, index terms 

The content of set associative and fully associative structures (such as cache memories, 
TLBs and main memories) is controlled by a replacement algorithm. Replacing the 
elements that have not been accessed for a long period yields high performance. This 
property has been used in the LRU policy, it is also used in this paper, in order to define a 
new class of replacement policies that provide two improvements over LRU : 1) they 
exhibit higher performance, 2) they have a lower complexity of implem ... 

11 Tuning garbage collection for reducing memory system energy in an embedded java ' 
environment 

G. Chen, R. Shetty, M. Kandemir, N. Vljaykrishnan, M. J. Irwin, M. Wolczko 
November 2002 ACM Transactions on Embedded Computing Systems (TECS), Volume l 
Issue 1 

Publisher: ACM Press 

Full text available- 1 ^1 pdf(740 23 KB) Add it' onal Information: full citation , abstract , references , citings , index 
• \a^.u : terms 

Java has been widely adopted as one of the software platforms for the seamless 
integration of diverse computing devices. Over the last year, there has been great 
momentum in adopting Java technology in devices such as cellphones, PDAs, and pagers 
where optimizing energy consumption is critical Since, traditionally, the Java virtual 
machine (JVM), the cornerstone of Java technology, is tuned for performance, taking into 
account energy consumption requires reevaluation, and possibly redesign oft ... 

Keywords: Garbage collector, Java Virtual Machine (JVM), K Virtual Machine (KVM), low 
power computing 



12 Dribble posting a master file 
Robert V. Head 

February 1966 Communications of the ACM, volume 9 issue 2 
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Publisher: ACM Press 
Full text available: *g) pdf(380.Q1 KB) Additional Information: full citation , abstract 

Many business applications employ sequential magnetic tape rather than random-access 
storage techniques to process a very small number of transactions against a voluminous 
master file. In such situations, it may prove economical to avoid creating a new master 
file during each updating run by producing instead a dribble ledger containing only those 
master file accounts which have experienced activity. 

13 Software for simulation 
Jerry Banks 

November 1996 Proceedings of the 28th conference on Winter simulation WSC '96 
Publisher: ACM Press, IEEE Computer Society 

Full text available:^ pdf( 9Q8.02 KB) Additional Information: full citation , abstract , references , citings 

This tutorial describes software for conducting computer simulation other software that 
supports simulation. 

14 A relational model of data for large shared data banks 
E. F. Codd 

June 1970 Communications of the ACM, volume 13 issue 6 
Publisher: ACM Press 

Full text available: 'jg pdf(1.27 MB) Additional Information: full citation , abstract , references , citings 

Future users of large data banks must be protected from having to know how the data is 
organized in the machine (the internal representation). A prompting service which 
supplies such information is not a satisfactory solution. Activities of users at terminals and 
most application programs should remain unaffected when the internal representation of 
data is changed and even when some aspects of the external representation are changed. 
Changes in data representation will often be needed as a ... 

Keywords: composition, consistency, data bank, data base, data integrity, data 
organization, data structure, derivability, hierarchies of data, join, networks of data, 
predicate calculus, redundancy, relations, retrieval language, security 



15 Authentication and si g nature schemes: On the performance , feasibilit y, a nd use o f 

forward-secure si g natures 
^ Eric Cronin, Sugih Jamin, Tal Malkin, Patrick McDaniel 

October 2003 Proceedings of the 10th ACM conference on Computer and 
communications security CCS '03 

Publisher: ACM Press 

Full text available- fi3 pdf(386 51 KB) Additional Information: full citation , abstract , references , citing s, index 

terms 

Forward -secure signatures (FSSs) have recently received much attention from the 
cryptographic theory community as a potentially realistic way to mitigate many of the 
difficulties digital signatures face with key exposure. However, no previous works have 
explored the practical performance of these proposed constructions in real-world 
applications, nor have they compared FSS to traditional, non-forward-secure, signatures 
in a non-asymptotic way. We present an empirical evaluation of several FSS sch ... 

Keywords: digital signatures, forward-secure signatures 



16 Trans portation, logistics, and distribution: An object-oriented paradi g m for simulating 
postal distribution centers 

K. Preston White, Brian Barney, Scott Keller, Robert Schwieters, Jacqueline Villasenor, 
William S. Terry, Richard G. Fairbrother, Richard D. Saxton 
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December 2001 Proceedings of the 33nd conference on Winter simulation WSC '01 
Publisher: IEEE Computer Society 

Full text available- fiB pdf (31 1 75 KB ) Additional Information: full citation , abstract , references , citings, index 
' ^ " terms 

Discrete-event simulation is an established tool for the design and management of large- 
scale mail sortation and distribution systems. Because the design of distribution facilities 
integrates many of the same or functionally similar components, adopting an object- 
oriented approach to simulation promises significant economies. Instead of coding and 
verifying models de novo for each facility, component subsystem, or individual process, 
object orientation allows engineers to reuse validated ... 

17 A conservative algorithm for computing the flow of permissions in Java programs j| 
Gleb Naumovich 

July 2002 ACM SIGSOFT Software Engineering Notes , Proceedings of the 2002 ACM 
SIGSOFT international symposium on Software testing and analysis ISSTA 

'02, Volume 27 Issue 4 
Publisher: ACM Press 

Full text available: l Q pdf(54Q.Q8 KB) Additional Information: full citation , abstract , references , citings 

Open distributed systems are becoming increasingly popular. Such systems include 
components that may be obtained from a number of different sources. For example, Java 
allows run-time loading of software components residing on remote machines. One 
unfortunate side-effect of this openness is the possibility that "hostile" software 
components may compromise the security of both the program and the system on which 
it runs. Java offers a built-in security mechanism, using which programmers can give p ... 



Keywords: data flow analysis, java, security, static analysis, verification 



18 Simulation of advanced manufacturing systems 
Gerald W. Evans, William E. Biles, Michael W. Golway 

December 1994 Proceedings of the 26th conference on Winter simulation WSC '94 
Publisher: Society for Computer Simulation International 

Full text available: ^ pdf(838.39 KB) Additional Information: full citation , references , citing s, index terms 



19 Use of parallel level 3 BLAS in LU factorization on three vector multiprocessors the 
^ ALLIANT FX/80, the CRAY-2. and the IBM 3090 VF 

^ M. J. Dayde, I. S. Duff 

June 1990 ACM SIGARCH Computer Architecture News , Proceedings of the 4th 
international conference on Supercomputi ng ICS '90, Volume 18 issue 3b 
Publisher: ACM Press 

Full text available* f£|pdf (1 22 MB) Additional Information: full citation , abstract , references , citings, index 

terms 

We show how to transform the B-spline curve and surface fitting problems into suffix 
computations of continued fractions. Then a parallel substitution scheme is introduced to 
compute the suffix values on a newly proposed mesh-of-unshuffle network. The derived 
parallel algorithm allows the curve interpolation through n points to be solved in &Ogr; 
(log n) time using &THgr;n/log n) processors and allows the surface interpolati ... 

20 The relational model for database mana g ement: version 2 

E. F. Codd 
January 1990 Book 

Publisher: Addison-Wesley Longman Publishing Co., Inc. 

Full text available 1 ^) pdf(28 61 MB) Additional Information: full citation , abstract , references , citings, index 

terms , review 
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An important adjunct to precision is a sound theoretical foundation. The relational model 
is solidly based on two parts of mathematics: firstorder predicate logic and the theory of 
relations. This book, however, does not dwell on the theoretical, foundations, but rather on 
all the features of the relational model that I now perceive as important for database 
users, and therefore for DBMS vendors. My perceptions result from 20 y ... 
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Distributed operating systems 
Andrew S. Tanenbaum, Robbert Van Renesse 

December 1985 ACM Computing Surveys (CSUR), Volume 17 issue 4 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
terms , review 



Full text available: ffl pdf(5.49 MB) 



Distributed operating systems have many aspects in common with centralized ones, but 
they also differ in certain ways. This paper is intended as an introduction to distributed 
operating systems, and especially to current university research about them. After a 
discussion of what constitutes a distributed operating system and how it is distinguished 
from a computer network, various key design issues are discussed. Then several 
examples of current research projects are examined in some detail ... 

An. intrusion tolerant architecture for dynamic content internet servers 
Ayda Saidane, Yves Deswarte, Vincent Nicomette 

October 2003 Proceedings of the 2003 ACM workshop on Survivable and self- 
regenerative systems: in association with 10th ACM Conference on 
Computer and Communications Security SSRS '03 

Publisher: ACM Press 

Full text available: ^ pdf(551.49 KB) Additional Information: full citation , abstract , references 

This paper describes a generic architecture for intrusion tolerant Internet servers. It aims 
to build systems that are able to survive attacks in the context of an open network such 
as the Internet. To do so, the design is based on fault tolerance techniques, in particular 
redundancy and diversification. These techniques give a system the additional resources 
to continue delivering the correct service to its legitimate clients even when active attacks 
are corrupting parts of the system compon ... 



Keywords: adaptive redundancy, fault tolerance, intrusion tolerance 



Understanding fauit-tolerant distributed systems 
Flavin Cristian 

February 1991 Communications of the ACM, Volume 34 issue 2 
Publisher: ACM Press 

Full text available: ffl pdf(6.17 MB) Additional Information: full citation , references , citings, index terms , 
• i/jh review 
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^ Andre Schiper, Michel Raynal 

April 1996 Communications of the ACM, Volume 39 Issue 4 
Publisher: ACM Press 

Full text available: < g| pdf(165.96 KB) Additional Information: full citation , references , citings, index terms 



5 A pproaches to fault-tolerant and transactional mobile agent execution— an 
<g> algorithmic view 

• Stefan Pleisch, Andre Schiper 
September 2004 ACM Computing Surveys (CSUR), Volume 36 issue 3 
Publisher: ACM Press 

Full text available: 'gj pdf(946.94 KB) Additional Information: full citation , abstract , references , index terms 

Over the past years, mobile agent technology has attracted considerable attention, and a 
significant body of literature has been published. To further develop mobile agent 
technology, reliability mechanisms such as fault tolerance and transaction support are 
required. This article aims at structuring the field of fault-tolerant and transactional 
mobile agent execution and thus at guiding the reader to understand the basic strengths 
and weaknesses of existing approaches. It starts with a discu ... 

Keywords: ACID, Byzantine failures, agreement problem, asynchronous system, commit, 
crash failures, fault tolerance, malicious places, mobile agents, replication, security, 
transaction 



6 A Survey of Techniques for Synchronization and Recovery in Decentralized 
^ Computer Systems 
^ Walter H. Kohler 

June 1981 ACM Computing Surveys (CSUR), Volume 13 issue 2 
Publisher: ACM Press 

Full text available: fg\ pdf(3.33 MB) Additional Information: full citation , references , citings , index terms 



The relational model for database management: version 2 
E. F. Codd 
January 1990 Book 

Publisher: Addison-Wesley Longman Publishing Co., Inc. 

Full text available: fQ pdf(28 61 MB) Additional Information: full citation, abstract , references , citings, index 
• ■ terms , review 

From the Preface (See Front Matter for full Preface) 

An important adjunct to precision is a sound theoretical foundation. The relational model 
is solidly based on two parts of mathematics: firstorder predicate logic and the theory of 
relations. This book, however, does not dwell on the theoretical foundations, but rather on 
all the features of the relational model that I now perceive as important for database 
users, and therefore for DBMS vendors. My perceptions result from 20 y ... 

Prog ramming languages for distributed computing systems 
Henri E. Bal, Jennifer G. Steiner, Andrew S. Tanenbaum 
September 1989 ACM Computing Surveys (CSUR), Volume 21 issue 3 
Publisher: ACM Press 

Full text available: If) pdf(6.50 MB) Additional Information: full citation, abstract, references , citings, index 
' ^ terms , review 

When distributed systems first appeared, they were programmed in traditional sequential 
languages, usually with the addition of a few library procedures for sending and receiving 
messages. As distributed applications became more commonplace and more 
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sophisticated, this ad hoc approach became less satisfactory. Researchers all over the 
world began designing new programming languages specifically for implementing 
distributed applications. These languages and their history, their underlying pr ... 

9 Continuous learnin g : a desi g n methodology for fault-tolerant neural networks 
Vincenzo Piuri 

June 1990 Proceedings of the 3rd international conference on Industrial and 

engineering applications of artificial intelligence and expert systems - 
Volume 2 IEA/AIE '90 

Publisher: ACM Press 

Full text available: *g pdf(1.36 MB ) Additional Information: full citation , abstract , references , index terms 

Fault tolerance in artificial neural networks is an important feature, in particular when the 
application is critical or when maintenance is difficult. This paper presents a general 
design methodology for designing fault-tolerant architectures, starting from the 
behavioral description of the nominal network and from the nominal algorithm. The 
behavioral level is considered to detect errors due to hardware faults, while system 
survival is guaranteed by the reactivation of learning mechanisms ... 

10 Providing fault-tolerant services to distributed Ada 95 a p plications Q 
y<|v Yvon Kermarrec, Laurent Nana, Laurent Pautet 

>/ December 1996 Proceedings of the conference on TRI-Ada '96: disciplined software 
development with Ada TRI-Ada '96 

Publisher: ACM Press 

Full text available: *g| pdf(837.Q5 KB) Additional Information: full citation , references , citings, index terms 
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11 Totem: a fault-tolerant multicast group communication system 

L. E. Moser, P. M. Melliar-Smith, D. A. Agarwal, R. K. Budhia, C. A. Lingley-Papadopoulos 
April 1996 Communications of the ACM, Volume 39 issue 4 
Publisher: ACM Press 

Full text available- ff\ pdf (342. 07 KB ) Add ' tiona ' Information: full citation , reference s, citings, index terms , 
' utl ' review 



12 Ariadne— an adaptive router for fault-tolerant multicomputer 
^fitv J. D. Allen, P. T. Gaughan, D. E. Schimmel, S. Yalamanchill 

N/ April 1994 ACM SIGARCH Computer Archi tecture News , Proceedings of the 21ST 

annual international symposium on Computer architecture ISCA '94, Volume 
22 Issue 2 

Publisher: IEEE Computer Society Press, ACM Press 

Full text available* pdf(1.21 MB) Additional Information: full citation , abstract, references , citings , index 

terms 

Adaptive routing has been proposed as a means of improving performance and fault- 
tolerance in multicomputer networks. While a number of algorithms have been proposed, 
few adaptive routers have been implemented in hardware. This paper presents the design 
and implementation of Ariadne — a prototype single chip, hardware router. The primary 
motivation is tolerance to link and router failures, while reconciling conflicting demands on 
performance. This is achieved by implementing the m-misro ... 

13 The process group a pproach to reliable distributed com puting 

Kenneth P. Birman 

December 1993 Communications of the ACM, volume 36 issue 12 
Publisher: ACM Press 

Full text available: ^| pdf(6.00 MB ) Additional Information: full citation , references , citings, index terms 
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Keywords: fault-tolerant process groups, message ordering, multicast communication 



14 Prototyping, verification, and test: Reducing pin and area overhead in fault-tolerant 
4k FPGA-based designs 
^ Fernanda Lima, Luigi Carro, Ricardo Reis 

February 2003 Proceedings of the 2003 ACM/SIGDA eleventh international 
symposium on Field programmable gate arrays FPGA '03 

Publisher: ACM Press 

Full text available: pdf(328.13 KB) Additional Information: full citation , abstract , references , index terms 

This paper proposes a new high-level technique for designing fault tolerant systems in 
SRAM-based FPGAs, without modifications in the FPGA architecture. Traditionally, TMR 
has been successfully applied in FPGAs to mitigate transient faults, which are likely to 
occur in space applications. However, TMR comes with high area and power dissipation 
penalties. The proposed technique was specifically developed for FPGAs to cope with 
transient faults in the user combinational and sequential logic, while ... 

Keywords: FPGA, fault-tolerance 



15 Parity logging overcoming the small write problem in redundant disk arrays 
Daniel Stodolsky, Garth Gibson, Mark Holland 

May 1993 ACM SIGARCH Computer Archi tecture News , Proceedings of the 20th 

annual international symposium on Computer architecture ISCA '93, Volume 

21 Issue 2 

Publisher: ACM Press 

Full text available- f0 odfd 35 MB) Additional Information: full citation , abstract , references , citings, index 
' kS j5L - L " terms 

Parity encoded redundant disk arrays provide highly reliable, cost effective secondary 
storage with high performance for read accesses and large write accesses. Their 
performance on small writes, however, is much worse than mirrored disks— the 
traditional, highly reliable, but expensive organization for secondary storage. 
Unfortunately, small writes are a substantial portion of the I/O workload of many 
important, demanding applications such as on-line transaction processing. This paper ... 

16 Session 2B: Sorting and searching in the presence of memory faults (without 
redundancy ) 

Irene Finocchi, Giuseppe F. Italiano 

June 2004 Proceedings of the thirty-sixth annual ACM symposium on Theory of 
computing STOC '04 

Publisher: ACM Press 

Full text available: *Q pdf(21 9.59 KB ) Additional Information: full citation , abstract , references , index terms 

We investigate the design of algorithms resilient to memory faults, i. e., algorithms that, 
despite the corruption of some memory values during their execution, are able to produce 
a correct output on the set of uncorrupted values. In this framework, we consider two 
fundamental problems: sorting and searching. In particular, we prove that any 0(nlog n) 
comparison-based sorting algorithm can tolerate at most 0((nlog n) 1/2 ) memory faults. 
Furthermore, we present one comparison-based sor ... 

Keywords: combinatorial algorithms, memory faults, memory models, searching, sorting 



17 ReVive: cost-effective architectural support for rollback recovery in shared-memory Q 
<H> multiprocessors 

^ Milos Prvulovic, Zheng Zhang, Josep Torrellas 

May 2002 ACM SIGARCH Computer Architecture News , Proceedings of the 29th 
annual international symposium on Computer architecture ISCA '02 , 
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Proceedings of the 29th annual international symposium on Computer 
architecture ISCA '02, volume 30 issue 2 
Publisher: IEEE Computer Society, ACM Press 

Full text available: ^ pdf(138 m) ||p Additional Information: full citation , abstract , references , citings, index 
Publisher Site 

This paper presents ReVive, a novel general-purpose rollback recovery mechanism for 
shared-memory multiprocessors. ReVive carefully balances the conflicting requirements of 
availability, performance, and hardware cost. ReVive performs checkpointing, logging, 
and distributed parity protection, all memory-based. It enables recovery from a wide class 
of errors, including the permanent loss of an entire node. To maintain high performance, 
ReVive includes specialized hardware that performs frequent o ... 

Keywords: fault tolerance, shared-memory multiprocessors, rollback recovery, recovery, 
BER, logging, parity, checkpointing, availability 

1 8 Some aspects of the GOTHIC system Q 
J. P. Banatre, M. Banatre 

September 1986 Proceedings of the 2nd workshop on Making distributed systems 
work EW 2 

Publisher: ACM Press 

Full text available: Q pdf(250.12 KB) Additional Information: full citation , abstract , references 

This note gives a short presentation of a distributed system currently under development 
{BANA-86a]. This system is structured according to the concept of atomic multi-procedure 
(section 2) and allows the handling of replicated and fragmented objects. The supporting 
architecture is a pluri-processor enriched with stable storage facilities (section 3). 

19 Reliability Issues in Computing System Desig n Q 
B. Randell, P. Lee, P. C. Treleaven 

June 1978 ACM Computing Surveys (CSUR), Volume 10 issue 2 
Publisher: ACM Press 

Full text available: Q pdf(3.95 MB) Additional information: full citation , references , citing s, index terms 



20 Parity logging disk array s 

Daniel Stodolsky, Mark Holland, William V. Courtright, Garth A. Gibson 
August 1994 ACM Transactions on Computer Systems (TOCS), Volume 12 issue 3 
Publisher: ACM Press 

Full text available- pdf(1 98 MB) Additional Information: full citation, abstra ct, references , citings, in dex 
• • terms 

Parity-encoded redundant disk arrays provide highly reliable, cost-effective secondary 
storage with high performance for reads and large writes. Their performance on small 
writes, however, is much worse than mirrored disks— the traditional, highly reliable, but 
expensive organization for secondary storage. Unfortunately, small writes are a 
substantial portion of the I/O workload of many important, demanding applications such 
as on-line transaction processing. This paper presents 
Keywords: RAID, disk arrays 
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Survivability analysis of networked systems 
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Survivability is the ability of a system to continue operating despite the presence of 
abnormal events such as failures and intrusions. Ensuring system survivability has 
increased in importance as critical infrastructures have become heavily dependent on 
computers. In this paper we present a systematic method for performing survivability 
analysis of networked systems. An architect injects failure and intrusion events into a 
system model and then visualizes the effects of the injected event ... 
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Fault tolerance in artificial neural networks is an important feature, in particular when the 
application is critical or when maintenance is difficult. This paper presents a general 
design methodology for designing fault-tolerant architectures, starting from the 
behavioral description of the nominal network and from the nominal algorithm. The 
behavioral level is considered to detect errors due to hardware faults, while system 
survival is guaranteed by the reactivation of learning mechanisms ... 
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December 1985 ACM Computing Surveys (CSUR), Volume 17 issue 4 
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Distributed operating systems have many aspects in common with centralized ones, but 
they also differ in certain ways. This paper is intended as an introduction to distributed 
operating systems, and especially to current university research about them. After a 
discussion of what constitutes a distributed operating system and how it is distinguished 
from a computer network, various key design issues are discussed. Then several 
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This paper presents Revive, a novel general-purpose rollback recovery mechanism for 
shared-memory multiprocessors. ReVive carefully balances the conflicting requirements of 
availability, performance, and hardware cost. ReVive performs checkpointing, logging, 
and distributed parity protection, all memory-based. It enables recovery from a wide class 
of errors, including the permanent loss of an entire node. To maintain high performance, 
ReVive includes specialized hardware that performs frequent o ... 

Keywords: fault tolerance, shared-memory multiprocessors, rollback recovery, recovery, 
BER, logging, parity, checkpointing, availability 
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Enterprise information architectures — they're finally changing 
Wesley P. Melling 

May 1994 ACM SIGMOD Record , Proceedings of the 1994 ACM SIGMOD international 

conference on Management of data SIGMOD '94, volume 23 issue 2 
Publisher: ACM Press 

Full text available: ^pdf(1.28 MB) Additional Information: full citation , abstract , references , index terms 

Substantive changes in the business environment— and aggressive initiatives in business 
process reengineering— are driving corresponding changes in the information technology 
architectures of large enterprises. Those changes are enabled by the convergence of a 
long list of maturing new technologies. As one of its many implications, the new IT 
architecture demands revised assumptions about the design and deployment of 
databases. This paper reviews the components of the architecture ... 

Microcode implemented General Modular Redundancy 
F. P. Mathur, P. T. de Sousa 

September 1974 Conference record of the 7th annual workshop on Microprogramming 
MICRO 7 

Publisher: ACM Press 

Full text available: ^| pdf(519.37 KB) Additional Information: full citation , abstract , references , index terms 

First the concepts of protective redundancy are described in the unified framework called 
General Modular Redundancy (GMR). GMR is a unified framework which synthesizes all 
the major redundancy techniques known. An alternative to an exclusively hardware 
implementation is by means of an extension to the Wensleyian Software Implemented 
Fault-Tolerance (SIFT) approach. A more attractive alternative, an implementation in 
microcode, is proposed and described here. 
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A taxonomy of computer pro g ram security flaws 
Carl E. Landwehr, Alan R. Bull, John P. McDermott, William S. Choi 
September 1994 ACM Computing Surveys (CSUR), volume 26 issue 3 
Publisher: ACM Press 
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An organized record of actual flaws can be useful to computer system designers, 
programmers, analysts, administrators, and users. This survey provides a taxonomy for 
computer program security flaws, with an Appendix that documents 50 actual security 
flaws. These flaws have all been described previously in the open literature, but in widely 
separated places. For those new to the field of computer security, they provide a good 
introduction to the characteristics of security flaws and how they ... 

Keywords: error/defect classification, security flaw, taxonomy 



10 Risks to the public: Risks to the public 
Peter G. Neumann 

May 2005 ACM SIGSOFT Software Engineering Notes, volume 30 issue 3 
Publisher: ACM Press 

Full text available: pdfd 77.87 KB) Additional Information: full citation , abstract , index terms 

Edited by Peter G. Neumann (Risks Forum Moderator and Chairman of the ACM 
Committee on Computers and Public Policy), plus personal contributions by others, as 
indicated. Opinions expressed are individual rather than organizational, and all of the 
usual disclaimers apply. We address problems relating to software, hardware, people, and 
other circumstances relating to computer systems. To economize on space, we include 
pointers to items in the online Risks Forum: (R i j) denotes RISKS vol i number ... 

11 Structured programming using p rocesses 
Jay Nelson 

September 2004 Proceedings of the 2004 ACM SIGPLAN workshop on Erlang ERLANG 
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Publisher: ACM Press 

Full text available- f 51 ! pdfd 16.57 KB) Additional Information: full citation , abstract , references , citings, index 
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Structured Programming techniques are applied to a personal accounting software 
application implemented in erlang as a demonstration of the utility of processes as design 
constructs. Techniques for enforcing strong encapsulation, partitioning for fault isolation 
and data flow instrumentation, reusing code, abstracting and adapting interfaces, 
simulating Inputs, managing and distributing resources and creating complex application 
behavior are described. The concept of inductive decomposition 
Keywords: COPL, concurrency oriented programming language, erlang, inductive 
decomposition 
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The article proposes a scalable protocol for replication management in large-scale 
replicated systems. The protocol organizes sites and data replicas into a tree-structured, 
hierarchical cluster architecture. The basic idea of the protocol is to accomplish the 
complex task of updating replicated data with a very large number of replicas by a set of 
related but independently committed transactions. Each transaction is responsible for 
updating replicas in exactly one cluster and invoking add ... 

Keywords: data replication, large-scale systems, multiview access 
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Research can rarely be performed on large-scale, distributed systems at the level of 
thousands of workstations. In this paper, we describe the motivating constraints, design 
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principles, and architecture for an extensible, distributed system operating in such an 
environment. The constraints include continuous operation, dynamic system evolution, 
and integration with extant systems. The Information Bus, our solution, is a novel 
synthesis of four design principles: core communication protoco ... 
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July 2001 Proceedings of the 23rd International Conference on Software 
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We report on experience that we made in the Trading room InteGRation Architecture 
project (TIGRA) at a large German bank. TIGRA developed a distributed system 
architecture for integrating different financial front-office trading systems with middle- 
and back-office applications. We generalize the experience by proposing an architectural 
style that can be re-used for similar enterprise application integration tasks. The TIGRA 
style is based on a separation of data representation using domain-s ... 
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November 1994 Proceedings of the 1994 ACM/IEEE conference on Supercomputing 
Supercomputing '94 
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Full text available: g pdf(719.76 KB) Additional Information: full citation , abstract , references 

The VPP500 vector parallel processor is a highly parallel, distributed memory 
supercomputer that has a performance range of 6.4 to 355 gigaFLOPS and a mai n 
memory capacity from 1 to 222 gigabytes. The system scalably supports between 4 and 
222 processors interconnected by a high-bandwidth crossbar network.Three key aspects 
of the VPP500, which are in sharp contrast to current massively parallel systems, 
characterize its architecture. First the building block is a 1.6 gigaFLOPS vector processor 
t ... 

20 Charles W. Bachman interview: September 25-26, 2004; Tucson, Arizona 
Thomas Haigh 

January 2006 ACM Oral History interviews 
Publisher: ACM Press 

Full text available: *g) pdf(761.66 KB) Additional Information: full citation , abstract 

Charles W. Bachman reviews his career. Born during 1924 in Kansas, Bachman attended 
high school in East Lansing, Michigan before joining the Army Anti Aircraft Artillery Corp, 
with which he spent two years in the Southwest Pacific Theater, during World War II. 
After his discharge from the military, Bachman earned a B.Sc. in Mechanical Engineering 
in 1948, followed immediately by an M.Sc. in the same discipline, from the University of 
Pennsylvania. On graduation, he went to work for Do ... 
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